Feeds to Scour
SubscribedAll
Scoured 9589 posts in 2.93 s
Poor neuro-motor tuning of the human larynx: a comparison of sung and whistled pitch imitation
pmc.ncbi.nlm.nih.gov·19h·
Discuss: Hacker News
🎧Learned Audio
Preview
Report Post
Stanford CS 224N | Natural Language Processing with Deep Learning
web.stanford.edu·1d
🧠Machine Learning
Preview
Report Post
MUSAN: A Music, Speech, and Noise Corpus
dev.to·1d·
Discuss: DEV
🎧Learned Audio
Preview
Report Post
Meta brings Segment Anything to audio, letting editors pull sounds from video with a click or text prompt
the-decoder.com·1d
🎧Audio Restoration
Preview
Report Post
wwes4/AI_Accel_1.5x: AI acceleration framework for ~1.5x speedups in mid-sized models via tension-based pruning. Built utilizing xAI's Grok.
github.com·1d·
Discuss: Hacker News
📊Quantization
Preview
Report Post
Large Language Models Approach Expert Pedagogical Quality in Math Tutoring but Differ in Instructional and Linguistic Profiles
arxiv.org·2d
🤖Grammar Induction
Preview
Report Post
TRUNAJOD: A text complexity library for text analysis built on spaCy — TRUNAJOD 0.1.1 documentation
trunajod20.readthedocs.io·5h
📝Parsing Grammars
Preview
Report Post
The Transformer Architecture: A Deep Dive into How LLMs Actually Work
dev.to·51m·
Discuss: DEV
📝Text Parsing
Preview
Report Post
mgsgde/whisper-shortcut: Speech-to-text and voice-to-prompt macOS app with Gemini and Whisper support
github.com·2d·
Discuss: Hacker News
🎙️Whisper
Preview
Report Post
How to Prioritize Naturalness in Voice Cloning for Brand-Aligned Tones
dev.to·3d·
Discuss: DEV
🎙️Whisper
Preview
Report Post
Language Log » Name-transcription slop
languagelog.ldc.upenn.edu·6d
🗜️Zstandardized Archives
Preview
Report Post
Show HN: Languagecat, a free dataset for people making language-learning apps
language.cat·3d·
Discuss: Hacker News
🤖Grammar Induction
Preview
Report Post
🧠 Beyond Chatbots: Building 'Echo-Learn', an Agentic AI Tutor with Biological Memory
dev.to·1d·
Discuss: DEV
🎙️Whisper
Preview
Report Post
VALLR-Pin: Dual-Decoding Visual Speech Recognition for Mandarin with Pinyin-Guided LLM Refinement
arxiv.org·3d
🎙️Whisper
Preview
Report Post
What Is ChatGPT Doing?
vibediary.dev·2d·
Discuss: Hacker News
🧠Neural Codecs
Preview
Report Post
MMSRARec: Summarization and Retrieval Augumented Sequential Recommendation Based on Multimodal Large Language Model
arxiv.org·2d
🔍Information Retrieval
Preview
Report Post
Toward Human-Centered AI-Assisted Terminology Work
arxiv.org·4d
🤖AI Translation
Preview
Report Post
Why Custom Evals Matter for Production LLMs
randalolson.com·3d·
Discuss: Hacker News
📏Code Metrics
Preview
Report Post
FluencyVE: Marrying Temporal-Aware Mamba with Bypass Attention for Video Editing
arxiv.org·2d
🗜️LZW Variants
Preview
Report Post
Foundation Model-based Evaluation of Neuropsychiatric Disorders: A Lifespan-Inclusive, Multi-Modal, and Multi-Lingual Study
arxiv.org·2d
🎙️Whisper
Preview
Report Post